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Costs  and  Payoffs  in  Pe.  ceptual  Research1 
Detlof  von  Winterfeldt  and  Ward  Edwards 

Engineering  Psychology  Laboratory 
University  of  Michigan 
Ann  Arbor,  Michigan 


I. 


ii 


Styles  in  perceptual  research  change,  reflecting  the  changing 
styles  of  psychological  research  and  theorizing.  Consider  the  problem 
faced  by  every  perceptual  researcher,  of  validating  his  subject's 
responses- that  Is,  of  finding  persuasive  reasons  to  believe  that  the 
behavior  emitted  by  the  subjects  in  some  sense  faithfully  reports  the 
inaccessible  processes  or  experiences  that  have  just  gone  on  inside  that 
subject's  head.  (This  question  readily  translates  into  the  language  of 
empty-organism  psychology,  and  has  the  same  meaning  there;  we  leave  the 
spelling  out  of  that  translation  to  others  more  masochistic  than  we.) 

In  the  days  of  Wundt  and  Titchener,  the  persuasive  reason  was 
I’Kely  to  take  the  form  ''The  subject  was  carefully  trained  in  my 
laboratory  to  report  accurately;  moreover  I  tried  it  myself  and  had 
the  same  experience.''  The  attack  runted  by  the  testa, t  psychologists 
and  the  behaviorists  on  this  sort  of  truth-by-gent, enmn's-agreement 


was  devastating,  and  proved  fatal.  Obviously,  in  two  years  you  can 
tram  a  willing,  intelligent  graduate  student  to  report  virtually 
anything  that  seems  appropriate  to  you,  in  whatever  language  is 
pleasing  to  your  ear. 

The  Gestalt  psychologists  emphasized  very  simple,  intuitively 
compelling  demonstrations  that  anyone  could  see.  This  was  fine,  as 
far  as  It  went.  But  not  all  perceptual  questions  can  be  resolved  by 
means  of  such  demonstratlons-and  even  If  a  demonstration  (for  example, 
of  brightness  constancy)  Is  Intuitively  compelling  to  all.  attempts  to 
measure  the  magnitude  of  the  effect  produce  the  usual  Individual  differences, 
and  so  raise  the  same  old  question  about  how  to  establish  an  orderly  rela¬ 
tion  between  responses  and  the  experiences  or  processes  underlying  them. 

The  Stevens-Garner  controversy  of  the  late  '40s  and  early  '50s 
presented  a  more  modern  version  of  the  structuralists'  dilenma.  Responses 
that  demand  a  very  high  degree  of  training  to  use,  such  as  direct  magnitude 
estimation,  did  not  agree  with  responses  that  produce  much  greater  inter¬ 
personal  agreement,  with  much  less  training,  such  as  those  based  on 
dlscrlmlnabillty.  Was  the  discrepancy  Induced  by  the  training,  or  were 
the  two  classes  of  procedures  tapping  different  kinds  of  psychological 
mechanisms?  In  the  latter  case,  which  was  the  "true"  representation  of, 
say,  loudness?  Or,  more  operationally,  which  was  more  useful  for  applica¬ 
tions  like  design  of  hearing  aids,  measurement  of  Industrial  noise,  etc.? 


In  the  mid- ! 50s,  a  new  approach  to  this  problem  of  validation 
entered  the  psychological  literature.  It  did  not  apply  to  all  perceptual 
issues,  but  it  presented  an  apparently  conclusive  solution  to  the  validation 
problem  for  all  issues  to  which  it  did  apply.  The  idea  was  very  simple: 
arrange  the  circumstances  of  the  experiment  so  that  it  is  in  S!s  self- 
interest  to  generate  a  response  that  depends  in  an  orderly  way  on  the 
internal  experience  or  process  being  studied,  make  sure  that  he  understands 
the  nature  of  this  self-interest,  and  then  assume  that  he  has  in  fact 
generated  a  response  appropriate  to  it.  The  earliest  version  of  this 
idea  was  used  in  signal  detectabi lity  experiments  (see  for  example.  Tanner 
-nd  Swets,  1954);  by  now  it  has  been  generalized  not  only  to  many  other 

kinds  of  perceptual  experiments  but  also  to  a  wide  variety  of  non-perceptual 
ones. 

This  approach  produces  what  might  ba  called  validation  by  cupidity. 

It  can  be  used  whenever  the  experimenter  is  willing  to  define  a  function 
on  the  product  set  of  all  stimulus  generation  procedures  used  in  the 
experiment  and  of  all  possible  responses  such  that,  given  the  procedures 
antecedent  to  an  occasion  on  which  a  response  occurs,  all  possible 
responses  that  might  occur  then  can  be  ordered  in  desirability.  (Transla¬ 
tion,  good  enough  for  most  r-rposes:  the  experimenter  knows  what  the 
correct  response  is.)  If  so,  the  experimenter  simply  rewards  the  subject 
more  for  more  desirable  than  for  less  desirable  responses.  He  assumes 
that  the  subject  wants  to  be  rewarded  (a  safe  enough  assumption),  and  that 
he  will  apply  rather  sophisticated  intellectual  tools  in  order  to  obtain 
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as  much  reward  as  possible  (some  problems  lie  here).  Since  he  finds 
it  boring  to  observe  subjects  simply  doing  what  the  stimulus  tells  them 
to  do,  he  designs  experiments  so  that  a  subject's  access  to  information 
about  the  merit  of  the  response  he  contemplates  making  Is  fallible,  and 
depends  as  sensitively  as  he  can  arrange  it  on  the  perceptual  or  other 
mechanism  he  wants  to  study.  Of  course  he  knows  that  the  subject's 
information  about  the  merits  of  his  contemplated  response  also  depends 
on  other  mechanisms,  and  that  the  response  select' on  process  depends 
on  still  others,  but  he  assumes  he  knows  fairly  well  what  those  other 
mechanisms  ^e.  By  assuming  that  the  subject  will  extract  as  much 
information  as  he  can  from  the  situation  and  the  perceptual  experience 
bearing  on  what  response  Is  best,  and  then  will  use  that  information 
optimally,  or  perhaps  only  systematically,  to  guide  his  responding, 
the  experimenter  can  infer  how  effectively  the  perceptual  mechanism  is 
providing  that  information. 

From  this  point  of  view,  the  reward  structure  of  an  experiment  serves 
four  reasonably  distinct  purposes: 

1.  Motivation:  It  encourages  the  subject  to  stay  awake,  pay 
attention,  not  goof  off,  and  take  the  experiment  seriously. 

2.  Instruction:  It  tells  the  subject  what  the  relevant  features 
of  the  experiment  are,  and  how  he  is  supposed  to  make  use  of  them. 

3.  Response  control:  It  Implicitly  specifies  for  the  subject  how 
best  to  translate  his  internal  experiences  or  processes  into  observable 


responses. 
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4.  Models  for  data  analysis:  It  permits  the  use  of  economic 
models  of  response  selec  ion  to  infer  the  internal  processes  of  interest 
to  the  experimenter. 

All  four  of  these  purposes  enter  to  some  extent  into  what  we  are 
calling  validation  by  cupidity,  but  obviously  the  last  two  are  the  crux 
cf  the  idea.  Although  much  of  this  chapter  explores  difficulties  and 
problems  connected  with  the  idea,  we  might  as  well  declare  right  now  that 
validation  by  cupidity  is  the  best  form  of  validation  available  to  psycholo¬ 
gists  who  study  internal  processes,  and  indeed  has  no  serious  competitors. 
One  consequence,  of  course,  is  that  we  see  no  way  whatever  (other  than 
gentlemen's  agreements)  to  validate  responses  such  as  magnitude  estimates, 
in  which  the  experimenter  has  no  basis  for  attaching  a  value  to  a  given 
stimulus-response  combination. 


Validation  by  cupidity  turns  perceptual  experiments  into  gambling 
experiments— or,  in  less  invidious  language,  into  experiments  on  human 
decision  making  under  uncertainty.  Naturally,  the  assumption  that  subjects 
extract  from  the  perceptual  experience  as  much  Information  as  possible 
bearing  on  the  merits  of  the  responses  available  to  him  is  a  very  strong 
one;  the  extraction  of  Information  from  complex  signals  Is  a  sophisticated 
and  demanding  process,  and  the  appropriate  mathematics  can  be  extremely 
complex.  Still,  the  approach  has  seemed  to  work.  Research  on  absolute 
and  differential  thresholds  Is  now  dominated  by  it,  and  new  applications, 
or  more  sophisticated  and  fancy  Interpretations  of  old  ones,  appear  every  day. 
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Few  users  of  these  ideas  really  believe  that  subjects  are,  for 
example,  processing  input  information  optimally  according  to  Bayes's 
theorem.  Yet  they  have  no  hesitation  to  use  Bayes's  theorem  in  analyzing 
the  results  of  human  information  processing  experiments.  Why?  Because 
it's  a  good  bet  that  any  response-selection  procedure  that  is  neither 
whimsical  nor  random  will  produce  data  indistinguishable  from  those 
produced  by  an  optimal  response-selection  procedure  based  on  somewhat 
less  information.  This  means  that  analyses  of  data  based  on  optimal 
models  will  almost  always  work,  in  the  sense  of  leading  to  sensible 
and  reproducible  results.  Typically,  those  results  cohere  fairly  well 
with  other  results. 

In  less  fancy  words:  decision  analysis  is  ipplied  to  the  results 
of  gambling-type  perceptual  experiments  because  It  works  and  it  would  be 
difficult  to  devise  any  orderly  response-selecting  mechanism  for  which 
such  analyses  would  not  work. 

In  1961.,  one  of  us  made  explicit  the  by- then-obvious  point  that 
costs  and  payoffs,  in  perceptual  or  other  experiments,  are  instructions 
(Edwards,  1961).  In  virtually  all  perceptual  experiments,  subjects  must 
choose  responses  in  a  way  that  trades  off  one  dimension  against  another— 
false  positives  against  missed  signals,  cost  of  errors  against  cost  of 
more  Information,  and  the  like.  Costs  and  payoffs  explicitly  instruct 
subjects  about  these  tradeoff  functions,  and  no  other  form  of  instruction 
does  (or,  more  carefully,  any  form  of  Instruction  that  specifies  relevant 
tradeoff  functions  is  equivalent  to  a  set  of  costs  and  payoffs). 


This  paper  picks  up  wherv  that  one  left  off.  Many  authors,  in 
many  contexts,  have  pointed  out  that  the  instructions  specified  by  costs 
and  payoffs  aren't  very  precise,  in  the  sense  that  the  decision-theoretical 
maxima  are  flat.  The  fact  of  flat  maxima  in  decision  analysis  has  been 
rediscovered,  with  dismay,  perhaps  a  dozen  times.  This  paper  rediscovers 
tnac  fact  one  more  time,  but  this  time  in  a  relatively  general  treatment. 

We  should  give  a  very  abstract  and  general  verbal  statement  of 
our  point  before  going  into  the  technical  details.  Our  point  is  this: 
Whenever  a  continuum  (or  a  dense  and  closely  spaced  set  of  discrete 
points)  enters  into  the  response  selection  process  in  an  important  way, 
whether  because  a  continuum  of  responses  is  available  or  because  response 
selection  depends  on  cutoff  points  on  an  underlying  continuum,  fairly 
substantial  changes  in  the  location  of  the  point  on  the  continuum  that 
controls  responding  (the  response  or  the  cutoff)  will  produce  extremely 
small  changes  in  the  economic  prospects  of  the  responder. 

To  put  it  another  way,  we  are  suggesting  an  addition  to  the  message 
of  Edwards's  1961  article.  Costs  and  payoffs,  though  often  the  only  feasibl 
form  of  instructions,  are  almost  always  rather  feeble;  although  they  tell 
the  subject  what  he  should  do,  they  don’t  punish  him  much  for  not  doing  it. 


IL  ^-gojts.  and  payoffs  are  feeble  instn.rtw 
A.  The  mathematical  formulation  t,ie  problem 

The  function  of  this  section  Is  to  give  technical  definitions  to  some  tones 
already  used  and  some  others  necessary  to  later  sections  of  this  paper. 

Consider  a  subject  who  does  not  care  about  the  stimuli  or  about 
the  payoff  structure  imposed  by  the  experimenter.  He  is  unmotivated,  does 
not  follow  instructions,  and  is  uninfluenced,  or  Influenced  only  capriciously 
by  variations  in  costs,  payoffs,  or  prior  probabilities.  Compare  him  with 
a  clever  and  avaricious  subject  in  the  same  experiment.  If  the  economic 
prospects  of  the  grasshopper  (to  use  a  technical  term  proposed  by  Aesop) 
are  not  considerably  worse  than  those  of  the  ant,  costs  and  payoffs  have 
not  served  their  purpose.  So  a  generally  necessary  condition  for  the 
effectiveness  of  costs  and  payoffs  in  a  given  experiment  is  that  economic 
prospects  must  change  substantially  for  different  responses.  We  must 
translate  this  condition  into  formal  language,  defining  what  we  mean 


by  "economic  prospects",  "substantial  change",  and  "different  responses"; 
to  do  the  last,  we  must  describe  responses  numerically. 

For  a  given  experiment,  a  single  number  is  associated  with  each 

response  or  set  of  responses;  its  expected  value  (EV).  We  Identify  EV 

with  economic  prospects,  and  will  hereafter  use  these  two  phrases  inter- 
changeably. 


. 
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We  should  define  EV.  To  do  so,  we  must  first  define  what  a 
stimulus  is.  Consider  Table  I,  and  think  of  it  in  the  context  of  a 
stimulus  identification  experiment  in  which  a  stimulus  is  presented  to 
the  subject,  and  he  must  say  which  of  several  possibilities  it  is. 

Insert  Table  I  about  here 

The  subject  receives  a  payoff  x..  if  stimulus  S.  has  been  presented  and 

I J  I 

his  response  was  d..  Here  and  throughout  th’*s  paper,  we  use  phrases  like 

J 

"stimulus  S.j  was  presented"  as  shorthand  for  a  more  complicated  idea. 
Technically,  is  not  a  stimulus  at  all,  but  rather  a  set  of  experimental 
procedures  designed  to  produce  cne.  So  the  accurate  but  tedious  phrasing 
would  be  "Operations  were  performed  on  or  by  the  apparatus,  producing 
a  stimulus  that  was  presented  to  the  subject."  Similarly,  an  experiment 
asking  the  subject  which  stimulus  was  presented  is  really  asking  which 
operations  on  or  by  the  stimulus-generating  apparatus  were  performed. 

Of  course  in  many  perceptual  and  other  experiments  the  stimulus  produced 
by  operations  S.  may  vary  from  instance  to  instance  within  the  experiment, 
and  may  be  only  mperfectly  known  to  the  experiiwnter. 

The  presentation  of  a  pe.rticular  stimulus  5.  will  give  the  subject 
information,  perhaps  fallible,  about  which  stimulus  actually  was  presented. 
, After  the  occurrence  of  S.,,  the  subject  will  have  a  personal  probability 
distribution  over  the  possible  stimuli;  that  distribution  associates  with 
each  S,.  a  probability  The  expected  value  to  the  subject  of  making 
response  d^  is  then  defined  as 


EV  (d.)  =  E  £•  x«. 
J  -j  1  1 J 


(1) 
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For  stimuli  that  vary  continuously,  integration  replaces  summation 
in  this  definition. 

We  have  no  reluctance  about  Identifying  the  economic  prospects 
of  response  dj  with  EV(dj),  thus  implying  that  subjects  should  select  a 
response-generating  procedure  that  maximizes  EV.  An  antique  fallacy 
questions  this  recommendation,  arguing  that  EV  maximization  is  a  wise 
Strategy  only  for  repeated  events.  As  we  see  it,  the  very  word  "strategy" 
implies  that  some  principle  of  response  selection  will  be  repeatedly 
applied;  the  arguments  that  make  EV  maximization  optimal  don't  care 
whether  the  conditions  are  or  are  not  constant  from  one  application  of 
the  strategy  to  the  next.  Besides,  most  perceptual  experiments  repeat 
the  same  condition  often  enough  so  that  the  most  passionate  relative- 
frequeritist  would  agree  that  EV  maximization  is  wise. 

Though  we  unhesitatingly  Identify  the  subject's  economic  prospects 
with  EV,  the  subject  may  not.  Whether  men  in  far;  maximize  EV  or  not  is  a 
profound,  difficult  question;  to  examine  it  here  would  take  us  too  far 
afield.  A  great  deal  of  experimentation  has  produced  no  evidence  against 
that  hypothesis;  data  analyses  based  on  it  abound  in  perceptual  experiments, 
mostly  because  they  work.  Moreover,  any  other  consistent  decision  rule 
that  permits  all  possible  economic  outcomes  of  dj  to  enter  into  assessment 
of  its  worth  with  appropriate  sign  and  with  some  monotonio  variation  of 
the  economic  consequence:  of  with  the  magnitude  of  each  pavoff  will, 
for  virtually  any  theoretical  or  practical  purpose,  be  indistinguishable 
from  EV  maximization.  (Note  that  we  are  using  the  notion  of  value  to 
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include  subjective  values  as  well  as  dollar  payoffs;  our  arithmetic 
will  be  done  on  dollars,  but  the  nature  of  our  results  will  imply  that 
it  makes  little  difference  whether  or  not  the  effective  payoff  to  the 
subject  is  or  is  not  linear  with  its  dollar  value,  so  long  as  it  is  strictly 
monotonic) . 

Next  we  must  give  an  account  of  perceptual  experiments  that  permits 
us  to  define  responses  and  the  strategies  that  produce  them.  Our  account, 
naturally,  will  be  decision-theoretical  in  spirit.  Here  and  throughout,  we 
shall  speak  of  both  stimulus-generating  operations  and  responses  as  being 
chosen  fror<  mutually  exclusive,  discrete,  finite  sets.  This  mild  idealization 
slightly  simplifies  the  mathematics  and  greatly  simplifies  the  language. 

Anyone  mathematically  demanding  enough  to  be  bothered  by  it  will  also  be 
mathematically  skilled  enough  to  see  the  easy  generalizations  of  the 
arguments  to  continuous  cases.  Following  the  current  conventions  of 
perceptual  research,  we  shall  assume  that  a  discrete  stimulus-generating 
operation  may  lead  to  any  member  of  a  continuously  distributed  set  of 
alternative  stimuli;  as  we  said  above,  "stimulus  S1  was  presented"  refers 
tc  the  discrete  operation  ,  not  to  the  actual  stimulus. 

We  suppose  that  before  S.  is  presented,  the  subject  has  a  prior 
probability  vector  over  the  set  of  possible  stimuli.  Upon  observing  (the 
stimulus  resulting  from  operation)  S.,,  he  transforms  that  vector  into  a 
posterior  probability  vector.  The  correct  rule  of  transformation,  of  course, 
is  Bayes’s  thee  im.  While  we  shall  use  that  where  relevant,  we  do  not  need 
to  assume  that  the  subject  uses  it;  the  weaker  assumption  that  the  subject 
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represents  his  observation  internally  as  a  likelihood  ratio,  or  some 
quantity  monotonical ly  related  to  likelihood  ratio,  and  that  he  takes 
prior  probabilities  into  account  in  response  selection  is  sufficient. 

Now  the  subject  makes  the  response  from  among  those  available  to  him  that 
has  the  best  economic  prospects.  He  may  or  may  not  receive  feedback.  In  a 
single-observation  experiment,  that  terminates  the  trial.  In  a  multiple- 
observation  experiment  he  may  choose  or  may  be  required  to  make  more  observa¬ 
tions.  All  experiments  end  each  trial  with  selection  of  a  terminal  response, 
perhaps  followed  by  feedback;  some  require  responses  (other  than  decisions 
to  look  at  more  information)  interspersed  with  the  sequence  of  observations. 

A  decision  rule  is  a  cutoff  vector  or  set  of  cutoff  vectors  defined 
in  the  set  of  vectors  of  posterior  probabilities.  It  specifies  which 
response  will  be  selected  given  any  such  vector.  In  a  twc-a' ternative  stimulus 
identification  task  with  a  symmetric  payoff  matrix,  for  example,  the  cutoff 
vector  would  probably  be  (2*2) •  The  subject  selects  response  d..  whenever 
his  personal  probability  that  was  presented  exceeds  £.  For  an  optimal 
subject,  or  one  whose  data  are  being  analyzed  as  though  he  were  optimal, 
choice  of  decision  rule  depends  only  on  the  payoff  structure  of  the  experi¬ 
ment,  and  so  can  be  known  before  5^  is  presented. 

In  most  experiments,  decisions  depend  not  only  on  payoffs  and  on 
observations,  but  also  on  prior  probabilities.  Information  about  the  prior 
probabilities  can  be  combined  with  Information  about  the  payoffs,  permitting 
the  decision  rule  to  be  redefined  (via  Bayes's  theorem)  from  the  set  of 
vectors  of  posterior  probabilities  to  a  set  of  vectors  of  likelihood  ratios, 
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and  sometimes  further  from  the  likelihood  ratios  to  a  set  of  physical 
characteristics  of  the  actual  stimulus,  and  occasionally  further  from  ohe 
actual  stimulus  to  the  stimulus-generating  operations  S1 .  Any  such 
redefinition  of  a  decision  rule  out  of  the  set  of  vectors  of  posterior 
probabilities  into  some  other  set  more  directly  related  to  the  stimulus 
presentation  conditions  of  the  experiment  we  shall  call  a  decision  function. 

A  decision  function  partitions  the  set  of  possible  observations  by  means 
of  criterion  or  cutoff  points.  In  signal  detection  theory,  for  example, 
the  likelihood  ratio  criterion  8  partitions  the  set  of  actual  observations 
(insofar  as  that  set  can  appropriately  be  mapped  into  a  set  of  likelihood 
ratios).  Within  each  class  of  observations,  the  same  response  is  appropriate. 
Obviously  any  translation  from  a  decision  rule  to  a  decision  function  depends 
on  a  model  of  the  sequence  of  processes  beginning  with  the  stimulus-generating 
operations  and  ending  with  the  internal  process  representing  tie  stimulus, 
and  is  no  more  trustworthy  than  that  model.  The  arguments  of  this  paper 
complicate  that  point  by  showing  that  the  inference  from  responses  to 
parameters  of  such  models  is  quite  weak;  whether  that  is  good  or  bad  depends 

on  the  purpose  for  which  the  model  is  being  used. 

In  some  experiments,  such  as  those  concerned  with  probability  estima¬ 
tion,  the  response  categories  available  to  the  subject  are  continuous,  or 
more  often  numerous,  ordered,  and  densely  spaced.  In  others,  such  as 
stimulus  identification  experiments,  the  set  of  available  response  categories 
will  typically  be  sparse.  However,  the  concepts  of  decision  rules  and 
decision  functions  permit  us  to  think  of  the  response-selecting  process 


as  either  continuous  or  as  having  numerous  ordered  and  densely  spaced 
alternatives  in  such  experiments  also.  So,  in  order  to  examine  the 
specificity  with  which  costs  and  payoffs  control  responding,  we  can 
simply  plot  the  expected  value  of  the  response,  decision  rule,  or  decision 
function,  as  appropriate,  against  a  numerical  representation  of  that 
response,  decision  rule,  or  decision  function.  Our  main  interest,  of 
course,  is  in  the  shape  of  this  function  around  its  maximum,  since  we 

assume  that  subjects,  motivated  by  cupidity,  try  to  sel-.t  their  responses 
in  a  fairly  optimal  way. 


The  thesis  of  this  paper  can  now  be  more  exactly  stated:  the 

expectec  value  of  responses,  decision  rules,  or  decision  functions  changes 

only  slightly  with  large  deviations  from  optimal  values.  Consequently  the 

economic  prospects  of  the  grasshopper  nay  be  only  slightly  worse  than  those 

of  the  ant;  economic  prospects  often  do  not  change  substantially  for  different 
response  selection  rules. 


B.  Expected  value  as  a  function  of  decisions 

Consider  a  recognition  task  in  which  the  subject  must  estimate  on 
a  0- to- 100  scale  the  probability  that  the  current  stimulus  is  old;  that  is, 
has  been  presented  before.  The  experiment  has  two  stimulus-generating 
conditions,  old  and  new,  a  dense,  orderly  set  of  available  responses,  and 
a  payoff  defined  for  each  stimulus-response  combination.  Let  5,  be  the 

response  the  subject  choose  to  make  on  a  given  trial.  Then  x  .  is  the 

0  >  J 
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payoff  for  it  if  the  stimulus  is  old,  and  .x  .  is  the  payoff  for  it 

n  t  j 

if  the  stimulus  is  new.  Let  be  the  subject's  actual  personal  probability 
on  this  trial  chat  the  stimulus  is  old;  of  course,  nothing  guarantees  that 
^j  =  ^t'  su^ect  can  evaluae  any  response  by  its  expected  value; 

EV  =  5t  +  xn,J  -Ct).  (2) 

For  each  value  of  £t.  a  value  of  £,  wil  l  exist  for  which  EV  (.£. ) 
will  be  maximized.  This  maximal  expected  value  EV*  will  be  a  convex 
function  of  .  Often,  the  experimenter  will  try  to  encourage  the  subject 
to  report  his  true  opinions  by  using  a  function  to  specify  the  x's  such  that 
EV  is  maximized  whenever  Such  functions  are  called  proper  scoring 

rules;  for  leads  into  the  extensive  literature  about  them,  see  Aczel  and 
Pfanzagl  (1966),  Murphy  and  Winkler  (1970),  or  Savage  (1971). 

A  typical  function  plotting  maximum  expected  value  giyen  optimal 
choice  of  ,  EV  ,  against  ^  is  shown  in  Figure  1.  It  represents  the 
quadratic  scoring  rule,  one  of  the  two  most  frequently  used  proper  scoring 

Insert  Figure  1  about  here 

rules,  where  xpj  =  1  -  (1  -  xnj  =  1  -  The  two  lines  show 

the  EVs  of  responses  ^  and  £2  as  a  function  of  £t.  Of  course  ^  would 
be  the  optimal  action  if  the  subject's  actual  probability  were  ^ ,  and 
similarly  for  this  follows  from  the  definition  of  proper  scoring  rules. 
Now,  assume  that  the  subject's  actual  probability  is  ,  but  that 
he  nevertheless  chooses  as  his  response. 
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His  economic  prospects  are  diminished  by  the  difference  between  the 
Evs  of  C-|  and  ^  Qi’ven  actual  probability  gj*.  in  Figure  1  that  difference 
is  labelled  A.  It  will  typically  be  very  small  in  relation  to  the  total 
EV  if  the  difference  between  actual  and  optimal  response  is  not  too 
large,  especially  if  the  optimal  EV  function  is  itself  rather  flat  in  the 
region  of  £. .  The  nearer  is  to  the  minimum  of  the  function,  that  is, 
the  more  uncertain  a  subject  is  about  what  response  is  best,  the  less  he 
v/ill  suffer  as  a  consequence  of  suboptimal  decisions. 

The  fact  that  proper  scoring  rules  have  this  undesirable  property 
of  flatness  is  well  known  (see,  for  example,  Murphy  and  Winkler,  1970). 
However,  the  ubiquity  of  proper  scoring  rules  is  less  well  knuwn.  First, 
notice  that  the  label  on  the  response  is  irrelevant;  proper  scoring  rules 
are  not  confined  to  situations  in  which  the  response  is  an  explicit 
probability  estimate.  A  useful  distinction  can  be  made  between  conditions 
that  must  be  satisfied  to  generate  a  proper  scoring  rule  and  conditions  that 
must  be  satisfied  just  to  recognize  one.  It  is  easy  to  recognize  a  proper 
scoring  rule:  any  list  of  acts  that  includes  none  that  are  dominated, 
stochastically  dominated,  or  duplicated  is  based  on  ?  payoff  matrix  that 
is  an  extract  from  a  proper  scoring  rr'.e.  That  sentence  sounds  fancier  than 
it  is.  All  it  means  is  that  if  each  act  has  the  property  that  some  set  of 
vectors  of  probabilities  makes  it  optimal,  then  choice  of  that  act  in  effect 
signals  estimation  of  a  vector  within  that  set.  Any  response  is  a  probability 
estimate.  Moreover,  the  linearity  of  the  equation  for  EV  guarantees  that  if 
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an  act  is  optimal  given  more  than  one  vector  of  probabilities,  then 
all  vectors  for  which  it  is  optimal  will  be  adjacent  to  one  another. 

That  word  "adjacent"  is  here  used  in  the  rather  special  sense  that  all 
these  adjacent  vectors  will  fail  within  a  closed  convex  region  of  a  regular 
hypertetrahedron  of  dimensionality  one  less  than  the  number  of  probabilities 
ir.  the  vector. 

(The  usual  sets  of  conditions  that  define  proper  scoring  rules  are 
more  complicated,  because  they  typically  are  designed  to  ensure  that 
different  vectors  produce  different,  scores.  But  such  issues  are  irrelevant 
here;  we  are  only  interested  in  the  fact  that  different  acts  identify 
different  sets  of  vectors.) 

This  argument  for  the  ubiquity  of  scoring  riles  means  little  for 
payoff  matrices  in  which  the  number  of  acts  approximates  the  number  of  states. 
But  for  payoff  matrices  in  which  the  number  of  acts  is  very  much  larger 
than  the  number  of  states,  and  yet  no  acts  are  dominated  or  stochastically 
dominated,  the  argument  for  flat  maxima  given  above  become  increasingly 
applicable.  If  the  act  space  is  continuous  or  acts  are  densely  distributed 
over  a  continuum,  the  argument  for  flat  maxima  applies  with  full  orce. 

C.  Expected  value  as  a  function  of  decision  rules 

So  far  we  have  discussed  reductions  in  EV  that  result  from  a  single 
nonoptimal  act.  Now  we  wi1!  examine  the  effect  of  consistently  applied 
nonoptimal  decision  rules.  Consider  a  task  in  which  the  subject  must 
discriminate  two  objects  according  to  brightness.  He  can  make  either  of 
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the  following  two  responses: 

=  Stimulus  1  is  brighter  than  stimulus  2 
d«  =  Stimulus  2  is  brighter  than  stimulus  1 

Correspondingly,  on  each  trial,  there  are  two  possible  states  of  nature: 
either  S-j  or  j2  1S  physically  brighter.  The  experimenter  defines  a  payoff 
matrix  as  in  Table  II,  where  a  and  d  can  be  thought  of  as  payoffs,  b  and 
c  as  costs.  We  assume  that  the  subject  must  perform  this  discrimination 

Insert  Table  II  about  here 

task  repeatedly  for  different  pairs  of  stimuli.  After  presentation  of  a 
pair  of  stimuli,  the  subject  will  have  some  probability  distribution  over 
the  two  states  of  nature: 

Pr(S1)  =  5,  Pr  (S2)  =  1  -  K 

The  optimal  strategy  for  such  a  task  ca*  be  derived  easily.  The  subject 
should  choose  d.j ,  whenever 

EV  (d.j)  >  EV  (d2)«  d2  otherwise.  I.  e.,  he  should  choose  d^  if 

5  a  +  (1-e)  c  >  £b  +  (l-C)d  (3) 

or 

S  >  (d-c)  /  [a  +  d  -  b  -  c]  =  p*  (4) 

This  result  is  represented  in  Figure  2. 
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The  EVs  of  the  two  decisions  are  linear  functions  of  £  and  their 
intersection  defines  the  cutoff  point  p*.  Therefore  optimal  strategy 
in  this  problem  is  totally  determined  by  p*.  Now  assume  that  the  subject 
in  fact  adopts  a  strategy  p  f  p*.  i.e.,  he  chooses  decision  1  whenever 

Insert  Figure  2  about  here 

£  >  p  and  decision  2  whenever  £  <  p.  How  will  the  EV  of  this  decision 
rule  compare  with  the  EV  of  the  optimal  strategy?  It  depends  on  the  subject's 
prior  opinion  about  the  posterior  distribution  of  £  over  trials.  Assume  for  a 
moment  that  tne  subject  considers  all  values  of  £  to  be  equally  likely.  The 
EV  can  be  expressed  in  terms  of  the  cutoff  point  p. 

EV ( p )  =  r  EV(d1)  d£  +  /P  EV (dp )  d£  = 

C«P  £=o 

-  1/2  p^  [a  +  d  -  b  -  c]  +  p[d  -  cj  +  1/2  [a  +  c]  (5) 

which  Is  a  quadratic  function  of  p,  whose  parameters  are  determined  by 
the  costs  and  payoffs  involved.  Its  first  derivative  is 

EV'(p)  =  -p  [a  +  d  -  b  -  c]  +  [d  -  c]  (6) 

From  this  it  follows  that  the  maximum  EV  is  obtained  when  we  set 

P  =  p*  =  (d  -  c)  /  [a  +  d  -  b  -  c]  (7) 


which  we  saw  in  Eq.  (4). 
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The  first  derivative  of  the  function  EV(p)  tells  us  how  steep 
EV(p)  is  around  the  optimal  value  of  p*.  Equation  (6)  shows  that  the 
steepness  of  E V ( p )  depends  on  the  values  of  all  payoffs.  The  larger  the 
costs  and  payoffs,  the  sticper  this  function  will  be— which  is  not  a 
surprise.  Two  examples  with  values  typically  used  in  psychological 
experiments  will  give  an  idea  of  how  flat  this  function  will  be  in  n.ost 
situations.  The  results  for  values  of  a  =  d  =  +1<£,  b  =  c  =  0$  (1)  and 
of  a  =  1 ,5<t,  b  =  c  =  -.5tf,  d  =  +  .5<t  (2)  can  be  seen  in  Figure  3. 

Insert  Figure  3  about  here 

A  more  intuitive  way  of  looking  at  the  expected  value  as  a  function 
of  a  response  strategy  p  results  from  an  inspection  of  the  areas  in  Figure 
2.  The  area  under  the  two  heavy  lines  defines  the  average  win  under  an 
optimal  strategy  p*.  The  shaded  area  represents  the  average  loss  due  to 
a  suboptimal  strategy  p.  Graphs  like  this  can  help  the  experimenter  to 
gain  insight  into  the  effectiveness  of  his  payoff  structure. 

Mathematically  the  description  of  the  expected  value  of  a  strategy 
p  as  a  function  of  p  and  the  outcome  structure  is  very  convenient.  It  is 
easy  to  see  that  by  multiplying  all  outcomes  by  a  factor  greater  than  1, 
the  experimenter  can  steepen  this  function  as  much  as  he  wishes.  However, 
Increasing  the  steepness  of  the  EV(p)  function  may  not  change  the  relative 
loss  of  any  noyi-optimal  strategy,  expressed  as  a  percentage  of  the  optimal 
EV.  Consider  the  payoff  matrix  displayed  in  table  II  and  the  resulting 
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EV  function  over  strategies  p.  If  we  multiply  all  outcomes  by  a 
constant  g  >  1,  we  will  get 

EVg(p)  “  g  EV(p)  (8) 

which  is  steeper  than  the  original  EV  function  Now  consider  the 
relative  quantity 

REL  (p)  =  100  [ EV ( p* )  -  EV(p)]  /  EV(p*)  (9) 

where  *EL  stands  for  relative  expected  loss  and  is  measured  in  percent 
(see  Edwards,  1956).  If  REL  is  the  value  with  which  subjects  ere  concerned, 
multiplying  all  outcomes  by  a  constant  does  not  affect  the  motivational 
effect  of  the  costs  and  payoffs,  since 

REL  (p)  =  RELg  (p)  00) 

The  experimenter  will  often  be  concerned  with  manipulating  the  REL  function 
rather  than  the  expected  value  function.  This  is  mathematically  simple 
but  unpleasant  computationally.  Let 


U  =  [a+d-b-c]  (Ha) 

V  =  [d  -  c]  0lb) 

U  =  [a  +  c]  0lc) 

From  this  it  follows  that 

p*  =  V/U  O2) 

EV(p*)  =  1/2  [w  +  V2  /  U],  03) 


REL(p)  =  100  [1  +  U2  p2  /  (UW  +  V2)  -  2UVp  /  (UW  +  V2)  - 

-  UW  /  (UW  +  V2)]. 


(14) 


In  the  foregoing  two  examples  we  find  that  the  two  REL  functions  are 
similar  in  flatness  to  the  E V  functions  (Figure  4).  For  example,  for 

Insert  Figure  4  about  here 

the  symmetric  (first)  payoff  matrix,  the  subject  would  not  lose  more  than 
8%  of  the  expected  value  of  the  optimal  strategy  p*  =  \  for  any  value 
of  p  betweh.i  i/4  and  3/4.  Similar  analyses  are  mathematically  fairly 
easy  to  develop  for  the  more  general  case  of  n  decisions  and  m  states 
but  they  lack  the  simple  graphical  interpretation  displayed  here— unless 
you  happen  to  be  good  at  visualizing  convex  regions  within  regular 
hypertetrahedrons. 

REL  functions  like  those  shown  in  Figure  4  are  extremely  useful 
ways  of  examining  the  properties  of  payoff  schemes,  but  they  need  careful 
interpretation.  This  part  of  the  paper  will  consider  only  symmetric  2x2 
payoff  matrices  and  prior  odds  of  1:1;  similar  but  somewhat  more  complicated 
arguments  apply  to  more  complicated  cases.  In  symmetric  cases,  the  subject 
can  guarantee  that  he  will  be  right  half  the  time  simply  by  flipping  a  coin 
Any  non-perverse strategy  must  be  at  least  as  good  as  that.  So  the  maximum 
feasible  REL  is  defined  as  1  -  (l/2pc),  where  p£  is  the  probability  of 
being  correct  if  the  subject  uses  the  optimal  strategy.  If  p  is  0.6, 
the  maximum  feasible  REL  is  1/6;  if  p  is  0.9,  it  is  4/9. 

V 

From  the  subject's  point  of  view,  the  REL  expresses  how  much 
difference  in  economic  prospects  exists  between  an  optimal  ant  and  a 
feckless  grasshopper,  for  various  degrees  and  kinds  of  fecklessness. 


If  a  grasshopper  can  earn  5/6  as  much  as  an  ant  does  without  even 
noticing  the  stimulus,  he  doesn't  have  much  ir.oentive  to  notice  it, 

much  less  to  think  about  its  meaning. 

From  a  somewhat  different  p  int.  of  view,  tne  distance  between  the 

optimal  REL  of  0  and  the  maximum  possible  REL  df vines  the  range  of  degrees 
of  success  the  experimenter  can  have  in  inducing  his  subject  to  perform  in 
an  ant-like  rather  than  a  grasshopper-like  way,  for  fixed  stimuli,  responses, 
and  payoffs.  The  nearer  the  subject  gets  to  0  REL,  the  more  successfully 
the  training  and  economic  pressures  are  combining  to  produce  art-like 
behavior.  From  this  point  of  view,  the  availablt  R'l  range,  numerically 
equal  to  the  maximum  feasible  REL  defined  above,  r«<*; ght  be  taken  as  100, 
and  the  percentage  deviation  of  REL  from  0  on  this  transformed  scale  might 

be  taken  as  an  index  of  fecklessness. 

Of  course  REL  is  calculable  only  if  pc,  the  probability  of  correct 

response  if  the  subject  uses  the  optimal  strategy,  is  known.  Often,  it 
won't  be. 

In  cases  using  asymmetric  payoff  matrices,  unless  they  are  balanced 
by  prior  odds  asymmetric  in  the  other  direction,  the  subject  can  gain  much 
more  than  half  of  the  available  payoff  by  a  stimulus-ignoring  strategy: 
simply  pick  the  act  that  has  the  larger  sum  of  payoffs  over  states.  If 
the  asymmetric  payoffs  are  exactly  balanced  by  priors  asymmetric  the  other 
way,  the  situation  is  reduced  to  the  symmetric  case  discussed  above.  And 
if  the  priors  are  even  more  asymmetric  than  the  payoffs,  once  again  the 


subject  can  gain  much  more  than  half  the  available  payoff  by  a 
stimulus-ignoring  strategy.  So  the  general  tendency  of  such  asymmetric 
arrangements  is  to  reduce  the  advantage  of  the  ant  over  the  grasshopper- 
provided  that  the  grasshopper  at  lear,c  notices  and  exploits  the  payoffs 
and  prior  probabilities. 

Nonuniform  distributions  over  E,  will  change  the  foregoing  analysis, 
but  as  long  as  values  of  £  close  to  p*  or  far  away  from  it  are  most  probable 
nonuniformity  will  only  increase  the  flatness  of  the  expected  value  or  the 
relative  expected  loss  function.  In  other  words,  subjects  will  suffer  less 
f>*om  using  nonoptimal  strategies  if  their  task  is  very  difficult  or  very 
easy  than  if  it  is  of  moderate  difficulty. 

For  tasks  of  moderate  or  severe  difficulty,  the  subject's  prior 
expectations  cbout  the  relative  frequency  of  the  various  stimulus  conditions 
bear  importantly  on  posterior  values  of  5.  During  the  early  trials  of  an 
e>  sriment,  the  subject's  distributions  over  £  may  be  fairly  uniform.  This 
lies  that  he  will  face  EV  and  REL  functions  lile  those  in  Figure  3  and  4- 
“  ely  steep.  As  a  result  of  his  experience  over  a  sequence  of  trials, 
hi.  -  and  consequently  his  posterior  distribution  over  £  on  each  trial 
will  oen  considerably  around  the  values  implied  by  the  experimenter's 
stimu  presentation  frequencies  and  the  difficulty  of  the  task.  The 
experi  iter  typically  dees  not  want  the  subject  to  find  the  task  easy, 
so  these  'alues  are  likely  to  lie  in  the  region  between  1/4  and  3/4.  An 
experimental  procedure  that  leads  to  values  of  £  in  this  range  will  flatten 
out  the  EV  and  REL  functions  considerably  over  those  of  Figure  3  and  4.  The 


effect  will  be  especially  marked  if  only  a  few  distinct  stimulus  rela¬ 
tive  frequencies  and  task  difficulties  are  used. 

D.  Expected  yalue  as  a  function  of  decision  functions 

In  decision  problems  with  observations,  flat  maxima  appear  when 
we  plot  EV  against  decision  functions,  or,  equivalently,  against  decision 
criteria.  Consider  a  simple  auditory  detection  task  (a  yes-no  task)  in 
which  a  pure  tone  may  or  may  not  be  embedded  in  white  noise  and  assume 
a  general  signal  detection  model  (see  for  example  Green  and  Swets,  1966). 

The  subject  must  perform  a  decision  task  based  on  a  single  observation. 
According  to  the  general  signal  detection  model,  the  subject's  observation 
is  a  random  variable  generated  either  from  a  signal  distribution  f(y|S) 
or  from  a  noise  distribution  f(y|N).  The  subject  Is  assumed  to  appl^  some 
likelihood  ratio  criterion  in  order  to  decide  whether  he  should  choose  act 
1  (observation  was  generated  by  the  noise  distribution)  or  act  2  (observation 
was  generated  by  the  signal  distribution).  Assuming  that  k  is  monotone 
with  6,  a  likelihood  ratio  criterion  6  generates  the  following  decision 
function  6: 

5  :  Y  •+  D  =  {d-j ,  dg)  with 

<$(y)  =  d.|  whenever  y  <  k 

<5{y)  =  dg  whenever  y  ^  k 


where  k  is  the  solution  to 
f(k|S)  /  f (k|N)  =  6 


(15) 
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The  general  formula  for  the  EV  of  such  a  decision  function  for  a 
continuous  state  and  decision  space  is 


EV(6)  =  /  /  V(s,fi(y))  f(yjS)  ijj(s)  ds  dy 
S  Y 


(16) 


where  V(s,  6(y))  is  the  outcome  associated  with  a  particular  state  value 
s  and  the  decision  defined  by  $(y)  .  ^(s)  Is  the  prior  distribution  over 
the  states.  Solving  for  the  particular  2x2  situations  of  this  signal 

detection  task,  we  obtain  for  a  payoff  matrix  like  the  one  In  Table  II 
and  prior  ^i(N)  =  £ 


EV(5)  =  Pr  (Y  <  k | N)  £(a  -  b)  +  b£  +  Pr  (Y  <  k|S)  (1  -  g)  (c  -  d) 
+  d  ( 1  -  0 


(17) 


Since  this  expectation  Is  solely  a  function  of  k,  we  can  write 
E V(6)  =  EV ( k)  =  EV(fi) 

From  first  derivative  of  this  function  it  can  easily  be  shown 
that  EV(k)  is  maximized  for  k  =  k*,  where  k*  Is  the  solution  to 


(18) 


f(k*|S)/f(k*|N)  =  [(1  -  0/5]  [d  -  c)/(a  -  b)]  =  e* 

This  is,  of  course,  a  familiar  result  In  signal  detection  theory,  e* 

Is  the  optimal  likelihood  ratio  criterion  specified  by  the  payoffs  and 
prior  probabilities  of  the  experimental  task.  How  EV  changes  as  a  function 
of  changes  in  k  or  B  depends  on  the  conditional  distribution  of  Y.  Specific 
results  can  be  derived  only  If  we  restrict  ourselves  to  particular  distribu¬ 
tions.  However,  the  expected  value  of  extreme  policies  can  easily  be 
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derived: 


EV(k  =  +<*>)  =  &K  +  (1  -  £)c  (19) 

EV(k  =  —)  =  be  +  (1  -  e)d  (20) 

Thus,  as  a  first  check,  without  assuming  any  specific  distribution,  the 
experimenter  can  analyze  the  maximum  differences  in  expected  value  generated 
by  different  response  strategies  (excluding,  of  course,  a  diabolical  subject 
who  would  use  an  optimal  decision  criterion  but  reverse  his  decisions.) 

Next,  consider 

EV(k*)  -  EV(k)  =  [/k*  f  (Y|N)  dY  -  (5*  f(Y|s)  dY]  E(a  -  b). 
k  k 

(21) 

the  loss  in  EV  caused  by  a  nonoptimal  strategy  k  (without  loss  of  generality 
we  assumed  that  k  <  k*).  Again  without  assuming  any  distributions,  some  major 
implications  can  be  drawn,  which  are  illustrated  in  Figure  5.  First,  note  that 
the  difference  in  EV  will  be  linear  in  the  prior  probability  and  the  payoffs. 

Insert  Figure  5  about  here 

For  extreme  values  of  £  or  for  large  costs  and  payoffs  the  loss  defined 
by  Eq.  (21)  v/Hl  increase.  Second,  assume  that  both  conditional  distributions 
have  a  small  amount  of  overlap.  Then  the  expression  in  the  brackets  of  Eq.  (21) 
will  typically  be  very  small,  since  the  integrals  are  represented  by  the  areas 
shaded  In  Figure  5  around  the  optimal  cutoff  point  k*.  In  other  words,  a 
large  d'  would  therefore  mean  that  suboptlmal  values  of  a  would  result  in 
little  less  EV  than  would  be  produced  by  a*.  Similar  conclusions  can  be 
drawn  for  a  large  amount  cf  overlap,  i.e.,  a  small  d'.  If  a*  is  near  1 
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the  expression  in  the  brackets  of  Eq.  (21)  will  become  small,  since  both 
integrals  will  have  nearly  equal  values.  If  the  separation  is  not  extreme  and 
if  k*  Is  shifted  in  the  direction  of  the  mean  of  either  distribution  by 
changing  prior  probabilities  or  payoffs,  one  of  the  two  integrals  in  Eq.  (21) 
will  usually  become  very  small,  while  the  other  increases.  The  relative 
change  in  expected  value,  however,  will  be  smaller  than  £(a  -  b).  This 
intuitive  analysis  permits  the  experimenter  to  visualize  the  effects  of  cost 
and  payoffs.  Values  of  £  near  1/2  and  quite  large  or  quite  small  separations 
of  the  conditional  distributions  tend  to  produce  flat  expected  value  and 
relative  expected  loss  functions.  In  Figure  6  the  expected  value  is  plotted 

Insert  Figure  6  about  here 

as  a  function  of  k  and  d'.  Here  we  made  the  assumption  that  both  conditional 
distributions  are  normal,  (see  also  Chinnis,  1971).  The  results  are  plotted 
for  the  symmetric  payoff  matrix.  Figure  7  shows  the  relative  expected  loss 
version  of  the  same  Information.  It  highlights  the  fact  that  intermediate 
values  of  d'  produce  most  steepness.  You  can  see  that  for  all  levels  of  d' 

Insert  Figure  7  about  here 

a  difference  of  +  1  standard  deviation  between  the  optimal  and  the  actual 
decision  criterion  would  result  in  a  loss  of  only  about  10%  or  less.  The 
equation  relating  B»  k,  and  d'  is 

In  6  =  k  d' .  (22) 

In  other  words  changes  in  k  are  linearly  related  to  changes  in  In  B.  and 
the  larger  the  value  of  d'  the  larger  is  the  change  in  In  B  associated  with 
a  give’.i  change  in  k. 
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Other  decision  functions,  the  so-called  stopping  rules,  can 
represent  response  sets  whenever  stimuli  from  the  same  stimulus-generating 
procedure  are  presented  in  sequence  and  the  subject  can  decide  after  seeing 
each  stimulus  (or  perhaps  less  frequently)  If  he  wants  to  make  a  response 
or  if  the  stimulus  presentation  should  be  continued,  rarhaps  at  a  cost. 
Rapaport  and  Burkhelmer  (1970,  1971)  showed  that  EV-functlons  are  also 
flat  over  stopping  rules.  DeGroot  (1970)  argues  that  costs  and  payoffs 
for  final  responses  have  much  less  Influence  on  stopping  rules  than  the 
cost  of  observations,  if  the  latter  cost  Is  relatively  small.  The  flatness 
of  the  EV-function  over  stopping  rules  is  important  to  studies  of  multiple 
observations  In  signal  detection  theory,  to  studies  of  choice  reaction 
time,  and  to  studies  of  Information-purchasing  decisions. 

E.  Expected  value  as  a  function  of  criterion  variability. 

So  far  we  assumed  that  the  subject's  responses  were  generated  by  a  systematic 
procedure,  such  as  the  use  of  fixed  cutoff  points.  How  will  the  subject's 
economic  prospects  change,  If  he  varies  his  decision  rule  or  decision  function 
randomly? 

As  an  example,  we  assume  that  a  subject  varies  his  p  criterion  in 
a  simple  decision  task  randomly  according  to  some  uniform  distribution 
between  values  pQ  and  p°,  both  equally  far  from  the  optimal  criterion  p*. 

We  know  from  the  previous  discussion  about  expected  value  functions  over 
decision  rules  that 


EV(p)  =  -l/2p2  U  +  p  V  +  1/2  W 


(see  equation  5) 
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Since  p  has  a  uniform  distribution,  we  can  infer  the  following 
expectation: 

E  (EV(p) )  •=  /  tEV(p)  /  (p°  -  pj]  dp  (23) 

Pp 

Figu^a  8  shows  how  this  expectation  varies  as  a  function  of  p°  for  the 
two  payoff  matrices  used  in  Figure  3.  Note  that  for  the  symmetric 
payoff  matrix  (1),  a  subject  will  not  lose  more  than  12%  of  the  maximal 
EV,  if  he  should  choose  a  decision  rule  at  random  from  the  interval  [0,  1] 
at  each  trial.  Of  course  this  expectation  will  always  be  larger  than  the  EV 
of  pQ  and  p  and  smaller  than  the  EV  of  p  .  Thus  the  flatness  of  the 
expected  value  function  as  a  function  of  p  determines  the  flatness  of  the 
expectation  considered  here.  Furthermore,  the  assumption  of  a  uniform 
distribution  makes  this  expectation  artificially  small.  In  most  situations, 
we  might  expect  the  subject  to  vary  his  response  criterion  according  to  some 
bell  shaped  function  around  a  criterion  p,  which  might  even  be  the  optimal 
criterion  p  .  This  in  turn  will  increase  the  expectation  substantially— 
that  is,  it  will  make  the  EV  of  the  random  strategy  even  closer  to  the 
EV  of  the  optimal  strategy  p*. 

It  is  well  known  that  criterion  variability,  within  any  signal 
detection  model,  will  result  in  lowered  estimates  of  d‘  (or  equivalent 
quantities),  but  the  magnitude  of  the  effect  is  not  so  well  known. 
Unfortunately,  the  function  that  relates  criterion  variability  to  decrease 
in  d  is  not  at  all  flat;  it  is  virtually  linear  over  interesting  parts 
of  its  range. 


Table  III  is  based  on  our  standard  symmetric  payoff  matrix,  and 
assumes  two  normal  distributions  differing  only  in  mean.  It  shows  the 

Insert  Table  III  about  here 

effect  of  various  amounts  of  criterion  variability  at  various  levels  of 
true  d1,  along  with  the  economic  consequences,  in  EV  and  REL,  of  this  kind 
of  suboptimal  behavior.  It  matters  very  little  what  the  form  of  the 
criterion  variation  function  is,  so  long  as  its  mean  is  at  k*  and  its 
standard  deviation  is  the  one  listed  in  the  table.  The  computations  in 
Table  III  assumed  normally  distributed  variations  in  k;  very  similar 
computations  would  apply  for  other  reasonable  assumptions.  Criterion 
variability  sufficient  to  produce  a  30%  reduction  in  d‘  will  cost  the 
subject  only  .05,  08,  and  .08  of  a  cent,  for  a  one-cent  difference 
between  the  value  of  right  and  of  being  wrong;  these  numbers  all  correspond 
to  RELs  of  less  than  10%.  If  it  costs  so  little  to  let  his  criterion  vary 
by  one  standard  deviation  or  more,  why  should  a  subject  bother  to  hold  it 
constant?  Nor  will  he  lose  much  sleep  over  the  plight  of  the  poor 
experimenter',  who  thinks  d1  is  1.4  when  it  is  actually  2.0. 
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III.  Consequences  for  research,  in  perception  and  elsewhere 


We  have  spent  many  pages  discussing  flat  maxima  by  means  of 
examples  but  we  never  answered  the  obvious  question:  How  flat  is  •‘Hat? 
We  might  choose  some  criterion  of  flatness— and  indeed  our  examles 
suggest  candidates.  But  without  behavioral  evidence  of  the  effects  of 


flatness  and  of  steepness,  any  such  choice  would  be  arbitrary.  We  might 
instead  define  flatness  by  means  of  its  behavioral  results,  and  let  the 
experimental  literature  tell  us  i:ow  flat  i*  flat.  But  the  literature 


is  silent;  the  obvious  experiments  haven't  been  done.  So  we  choose  to 
leave  the  definition  of  flatness  to  your  intuition— and,  we  hope,  to  the 
data  that  our  sad  story  leads  you  to  collect. 


We  are  both  decision  theorists,  and  much  of  the  preceding  discussion 
shows  the  feebleness  of  decision  theory.  A  natural  conclusion  for  us  is  to 
order  two  steins  of  hemlock.  While  we  sip,  we  can  amuse  ourselves  by 
observing  the  asymmetry  between  those  who  wish  to  use  decision  theory  and 
those  who  worry  about  the  fact  that  they  and  others  aren't  using  it.  Those 
who  wish  to  use  decision  theory,  whether  as  a  basis  for  experimental  design 
or  as  a  practical  tool  in  real  contexts,  should  be  seriously  disconcerted 
by  our  whole  line  of  reasoning,  since  our  fundamental  conclusion  is  that 
decision-theoretical  niceties,  such  as  eliciting  exactly  appropriate  values 
of  probability  or  payoff,  are  unimportant.  An  escape  from  this  conclusion 
may  be  that  E V,  rather  than  REL,  is  what  really  counts,  and  a  1%  decrease 
of  E V  in  a  $10  decision  is  still  a  $10'  loss.  That  thought  helps  consultants 
more  than  it  does  experimenters. 
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Those  who  worry  about  the  obviously  non-optimal  behavior  of  decision¬ 
makers,  and  wonder  how  the  world  manages  to  hold  together  in  spite  of  human 
inefficiency,  should  look  at  the  previous  pages  with  quite  different  eyes. 
Our  argument  is  readily  inter pretable  as  a  theory  about  the  answer  to  that 
question—and  a  persuasive  one.  The  world  holds  together  and  functions  as 
well  as  it  does  because  major  stupidities,  inefficiences,  selfishnesses, 
and  the  like  on  the  part  of  decision-makers  produce  only  minor  losses  in 
EV.  The  physical  facts  of  life  are  typically  far  more  important  than  the 
subtleties  of  human  decision-making  in  controlling  the  EV  of  an  action,  for 
two  intertwined  reasons.  First,  the  outcome  of  any  significant  decision 
depends  on  Nature  (or  chance,  or  the  opponent,  or  whatever)  as  well  as  on 
the  act  chosen.  A  good  decision  can  lead  to  a  bad  outcome,  and  often  does. 
Second,  the  flatness  of  EV  functions  means  that  the  best  decision  may  be 
hard  to  discover. 

Moreover,  since  wide  variations  in  decision  lead  to  similar  Eys, 
men  are  encouraged  to  attribute  success  to  their  own  insight  in  varying 
their  decisions,  rather  than  to  chance.  (On  this  state  of  mind  rests  the 
state  of  Nevada.)  We  attribute  success,  in  ourselves  or  in  others,  to  good 
management  rather  than  to  good  luck— and  then  wonder  what  is  wrong  when  our 
luck  changes. 

The  entire  preceding  discussion  should  be  immensely  reassuring  to 
those  who  worry  about  the  fate  of  man  and  the  vagaries  ov  politics.  But  it 
does  not  speak  to  the  problems  of  the  researcher  on  perception  who,  having 
been  offered  and  having  accepted  decision  theory  as  a  major  research  tool, 
now  finds  out  that  it  is  a  feeble  one. 
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Our  comfort  for  him,  if  not  cold,  is  no  better  than  luke-warm.  we 
do  not  believe  he  has  any  alternative  to  decision-theoretical  experi^ntal 
designs.  Validation  by  cupidity,  though  less  precise  than  you  and  we 

mi9ht  1,ke>  fS  the  °nl*  lntellectually  acceptable  form  of  validation  „e 
Know  of.  And  validation  by  gentlemen' i  agreement,  its  only  serious 
competitor,  is  no  validation  at  all.  (Except,  perhaps,  for  in-groups  of 
seeing  gentlemen.)  And  validation  by  convergent  operations  is  simply  a 
more  complicated  and  elaborate  form  of  validation  by  gentlemen's  agreement 
unless  those  converging  operations  give  some  good  reason,  other  than 

cooperativeness  and  good-followship,  for  subjects  to  search  their  souls 
and  report  their  true  opinions. 

But  we  do  believe  that  perceptual  experiments  built  on  decision- 
theoretical  ideas  can  be  Improved,  indeed,  „e  have  in  this  paper  presented 
the  tools  we  regard  as  most  useful  for  making  such  Improvements.  The 
experimenter  should  know  how  effective  his  payoff  arranges  can  be,  and 

Sta,d  ^  “  le8St  aS  ™Ch  improving  them  as  to  improving 

his  stimulus-generating  apparatus. 

A.  What  to  do:  increasing  the  effectiveness  of  costs  and  payoffs 

Experiments  that  use  costs  and  payoffs  are  gambling  experiments; 
the  key  to  improving  them  is  stipulation  of  their  gambling  characteristics 
The  experimenter  will  typically  know  exactly  what  his  costs  and  payoffs 
be  is  likely  to  know  somewhat  less  about  the  prior  probabilities  affecting 
subjects'  choices,  and  considerably  less  about  posterior  probabilities.  But 
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he  at  least  knows  a  lot  about  the  variables,  such  as  stimulus  parameters, 
that  affect  posterior  probabilities. 

The  graph  of  the  EV  function  over  measures  representing  responses 
or  response  sets  characterizes  the  effectiveness  of  a  payoff  function,  other 
experimental  parameters  held  constant.  The  experimenter  wants  to  make  that 
graph  as  steep  around  its  maximum  as  he  can  manage. 

The  most  obvious  and  most  successful  way  of  doing  this  is  simply  to 

increase  the  absolute  values  of  both  costs  and  payoffs.  This  leaves  REL 

unaffected,  but  strongly  affects  EV— and  no  one  pays  off  in  RELs.  The  main 

problem  in  doing  so  is  that  large  costs  and  payo'fs  introduce  money-management 

problems  into  the  experiment.  However,  such  problems  can  at  least  be 

palliated;  often,  they  can  be  solved.  For  an  example  of  how  far  one  can  go 

with  college  student  subjects,  see  Swensson  and  Edwards  (1971)  and  Swensson 

(1968).  The  latter  experiment  used  costs  and  payoffs  up  to  $10  for  a  single 
response. 


In  money  management  of  gambling  experiments,  one  principle  outweighs 
all  Others:  cheat!.  Adjusting  EV  to  come  out  right  is  no  problem,  no  matter 
how  large  the  stakes.  But  increasing  stakes  increases  the  variance  of  total 
earnings,  and  that  is  the  problem.  An  experimenter  who  Is  a  careful  cheater 
will  arrange  his  experimental  conditions  so  that  most  subjects  win  less  than 


they  should— and  will  add  further  stimuli 
to  bring  them  up  to  the  desired  pay  rate. 


at  the  end  of  the  experiment  designed 
In  perceptual  experiments,  this  is 


especially  easy,  since  manipulations  that  make  the  subject  right  nearly  100* 
of  the  time  are  ridiculously  easy  to  perform  and  hard  to  detect.  Inflexible 


r!,les  about  the  minimum  rate  of  pay  for  each  subject  will  help  get  such 
research  strategies  past  Human  Subject  Committees. 

If  the  experimenter  is  concerned  about  steepening  the  REL  as  well 
as  the  EV  function,  he  faces  a  more  difficult  problem.  Equation  14  helps-- 
a  little. 

Variation  in  the  physical  values  of  stimulus  will  produce  variation 
in  posterior  probabilities;  to  some  extent  this  helps  control  the  distribution 
of  over  trials,  and  that  in  turn  enters  into  the  sharpness  or  flatness  of 
the  REL  function.  Translated  out  of  the  mathematics,  this  says  that  the 
experimenter  can  steepen  the  REL  function  considerably  by  mixing  up  his 
stimuli.  Uniformly  hard-to-discriminate  stimuli  (in  a  discrimination 
experiment)  will  lead  to  a  peaked  function  and  so  to  a  flatter  REL 
function.  Whether  this  prescription  for  peaking  up  the  REL  function 
conflicts  with  the  experimenter's  wish  to  get  as  much  information  as 
possible  from  each  trial  of  his  experiment  will  depend  on  the  experiment. 

Often,  the  conflict  will  exist  and  a  compromise  will  be  called  for.  As 
has  already  been  pointed  out,  skewed  payoff  matrices  are  a  poor  idea 
unless  they  are  balanced  by  prior  distributions  skewed  the  other  way. 

Extreme  priors  make  the  EV  function  more  skewed,  but  probably  cost  too 
much  in  reducing  the  informational  content  of  the  experiment. 

In  signal  detection  experiments  of  the  yes-no  type,  how  does  d' 
relate  to  EV?  Figure  6  shows  that  different  values  of  d'  will  result  In 
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differences  in  the  maximum  attainable  EV,  and  therefore  in  the  flatness 
of  the  EV  function  over  values  of  k  or  B.  For  symmetric  situations,  and 
assuming  normal  distributions,  a  d'  between  2  and  3  is  best. 

Variability  in  cutoffs  flattens  the  EV  function.  Any  procedure  that 
steepens  the  EV  function  will  of  course  steepen  the  function  resulting  from 
a  specified  cutoff  variability.  In  addition,  cutoff  variability  may  be 
influenceable  by  instructions,  and  will  certainly  be  a  function  of  EV 
variance;  the  more  EV  a  subject  stands  to  lose  by  varying  the  cutoff, 
the  less  likely  he  is  to  do  so. 

Not  much  can  be  done  for  probability  estimates  or  for  sequential- 
sampling  experiments,  other  than  raising  the  stakes. 

Our  crucial  point  here  is  that  the  experimenter  should  look  at  the 
appropriate  function  relating  EV  or  REL  to  both  optimal  and  other  behavior, 
and  use  that  function  as  a  tool  of  experimental  design.  He  can  also  use  it 
as  a  tool  of  data  interpretation  if  he  is  willing  to  make  some  fairly 
plausible  and  straightforward  assumptions  about  what  the  subject  has  done 
in  the  face  of  the  experimental  conditions. 

An  important  way  in  which  an  experimenter  can  improve  his  experiments 
is  to  measure  the  extent  to  which  the  payoffs  he  intends  to  use  in  fact  lead 
to  optimal  or  near-optimal  behavior.  For  instance,  in  a  signal  detection 
experiment  he  can  use  known  stimuli,  such  as  numbers  sampled  from  one  of 
two  normal  distributions,  in  preliminary  calibration  sessions,  along 
with  the  payoff  structure  intended  for  the  main  experiment.  This  procedure 
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would  at  least  give  the  experimenter  some  idea  of  how  much  criterion 
variability  and  suboptimai  responding  to  expect.  It  might  also  suggest 
modifications  of  the  payoff  structure. 

In  experiments  in  which  trial-to-trial  fluctuations  of  stimuli 
can  be  physically  measured  (e.  g.  by  means  of  a  microphone  in  or  near 
the  ear),  they  should  be.  Such  measurements  can  suggest  inferences  about 
the  nature  of  the  effective  stimulus  on  this  trial,  and  thus  inferences 
about  wivt  the  subject  is  actually  doing. 

B.  What  to  do:  interpreting  experiments 

Typical  payoffs  in  perceptual  experiments  have  been  a  few  cents,  or 
even  fractions  of  cents  or  points.  This  does  make  management  easier--but 
we  hope  that  by  now  you  are  chewing  your  fingernails  about  the  flatness  of 
the  maxima  that  result. 

If  -he  subject's  economic  prospects  are  essentially  unaffected  by 
what  ne  does,  what  will  he  do?  He  might  behave  randomly,  especially  if  he 
doesn't  know  he  is  doing  so.  Often,  other  facts  about  the  experiment  imply 
non-* lonetary  payoffs,  and  the  subject  is  likely  to  pay  most  attention  to  these. 
He  likes  to  be  right,  so  he  may  not  respond  well  to  asymmetric  payoffs.  He 
may  minimize  effort,  perhaps  by  as  extreme  a  procedure  as  ignorning  the  stimulus 
or  perhaps  by  hunting  for  some  simple  strategy.  He  may  look  for  hidden  meanings 
in  the  experiment,  and  so  develop  self-instructions  ard  idiosyncratic  response 
strategies.  (The  famous  probability  matching  phenomenon  in  repetitive-birary- 
choitc  experiments,  to  the  extent  th'jt  it  occurs  at  all,  seems  to  be  a  result 
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of  this  kind  of  self-instruction  to  vary  responses.)  Or  he  may  just  be 
bored. 

We  wish  we  could  cite  a  string  of  experiments  all  showing  the  effects 
of  flat  maxima.  We  can't.  The  difficulty  is  that  most  perceptual  (and 
other)  experiments  are  not  designed  to  disentangle  such  decision  variables 
from  the  perceptual  variables  being  studied.  The  preceding  section  contained 
some  comments  about  how  to  avoid  this  confounding,  but  the  existing  literature 
contains  few  instances  of  successful  avoidance. 

The  effect  of  flat  maxima  may  be  to  produce  random  errors,  ;;ystematic 
errors,  or  both.  Several  signal  detection,  recognition,  and  discriminatic^. 
tasks  show  that  subjects  adopt  response  strategies  that  lie  between  optimal 
and  error  minimizing  strategies.  In  signal  detection  experiments.  Green 
(1960),  and  Swets,  Tanner,  and  Birdsall  (1961)  found  that  subjects  tended 
to  deviate  from  optimal  cutoffs  in  the  direction  of  making  both  responses 
more  nearly  equally  often.  Green  concluded  that  "The  way  In  which  the 
expected  value  changes  for  various  criterion  levels  is  the  crux  of  the 
problem;"  that  is,  that  he  had  flat-maximum  trouble.  Others  have  proposed 
that  subjects  regard  such  tasks  as  being  sensorily  oriented,  or  that  subjects 
do  not  in  fact  maximize  E V.  These  hypotheses  cannot  be  studied  without 
larger  payoffs  and  steeper  EV  functions.  Uhlela  (1966)  also  reported 
nonoptimal  decision  criteria  in  an  experiment  requiring  recognition  of 
tilted  lines.  Again,  the  subjects  tended  toward  an  error  minimizing  strategy. 
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Similar  findings  have  been  reported  for  other  tasks  by  Uhl  el  a  and  Schmaltz 
(1966)  and  by  Lleblich  and  Llebllch  (1969a,  1969b). 

Virtually  all  of  these  experiments  combine  skewed  payoff  matrices 
with  quite  difficult  discriminations.  This  produces  a  probability  distribu¬ 
tion  over  £  that  is  peaked  at  .5.  Skewed  payoff  matrices  produce  optimal 
decision  rules  such  that  p  is  well  away  from  .5.  As  arguments  above  show, 
such  procedures  lead  to  very  flat  maxima  in  the  region  of  p*.  This  point 
implies  an  explanation  of  Uhlela  and  Schmaltz's  (1966)  findings  that 
presentation  rates  (priors)  had  a  larger  effect  on  decision  criteria  than 
did  costs  and  payoffs.  Changing  costs  and  payoffs  only  controls  the 
location  of  p  .  But  changing  the  prior  both  changes  the  location  of  the 
criterion  (if  it  is  defined  in  likelihood  ratio  or  related  ways)  and  changes 
the  flatness  of  EV  over  other  criteria  in  the  region  of  p*. 

Criterion  variability  is  very  important  to  the  interpretation  of 
signal  detection  experiments.  The  formal  arguments  summarized  in  Table  III 
suggested  that  abundant  amounts  of  it  should  be  present  in  typical 
experiments.  Hammer ton  (1970),  using  a  simulated  signal  detection  task  in 
which  subjects  made  inferences  about  parameters  of  normal  distributions,  showed 
that  they  did  not  adopt  stable  criteria.  Galanter  and  Holman  (1967)  also 
reported  that  subjects  used  decision  strategies  inconsistently  when  faced 
with  different  payoff  matrices.  Where  they  have  been  studied,  the  implications 
of  Table  III  seem  to  be  experimentally  confirmed.  We  conclude  that  most 
values  of  d'  reported  in  the  literature  are  substantially  depressed— 
sufficiently  so  that  reported  values  of  d'  in  most  experiments  should  be 
regarded  only  as  lenient  lower  bounds  on  true  values. 


liawMifiifesaiif 


-  41  - 


Can  this  distressing  picture  be  alleviated  by  increasing  costs 
and  payoffs?  Some  data  report  invariance  of  d'  for  different  payoff 
structures  and  levels  of  motivation  (for  a  summary  see  Green  and  Swet<>, 

1966;  for  specific  studies,  see  Swets  and  Sewall,  1967,  and  Lukaszewski  and 
Elliott,  1962).  On  the  other  hand  Watson  and  Clopton  (1969)  and  Calfee  (1970) 
found  noticeable  effects  of  costs  and  payoffs  on  the  detection  rate  and  on  d' 
respectively. 

No  experiments  have  studied  the  effect  of  changes  in  costs  and 
payoffs  on  the  effects  of  proper  sctrlng  rules  for  probability  estimation, 
so  far.  The  scoring  rule  maxima  are  so  flat  that  we  are  very  pessimistic— 
yet  the  studies  need  to  be  done.  In  their  absence,  there  seems  to  be  little 
point  in  using  proper  scoring  rules  to  improve  probability  estimates— except 
in  the  sense  that  the  scoring  rules  are  really  instructions  about  the  meaning 
of  probability  estimates,  and  so  may  have  merits  not  dependent  on  their 
property  of  rewarding  most  the  man  who  makes  the  most  truthful  estimate. 
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Figure  Captions 


Fig.  1.  Maximal  EV  as  a  function  of  the  actual  probability  for 
a  quadratic  proper  scoring  rule. 

Fig.  2.  EV  as  a  function  of  probability  for  two  actions. 

Fig.  3.  EV  as  a  function  of  the  cutoff  probability  for  two  different 
payoff  matrices. 

Fig.  4.  REL  as  a  function  of  the  cutoff  probability  for  two  different 
payoff  matrices. 

Fig.  5.  Hypothetical  conditional  probability  distributions  of  the  random 
observation  variable  Y. 

Fig.  6.  EV  as  a  function  of  the  decision  criterion  for  a  symmetric  payoff 
matrix  and  for  different  values  of  d'. 

Fig.  7.  REL  as  a  function  of  the  decision  criterion  for  a  symmetric 
payoff  matrix  and  for  different  values  of  d'. 

Fig.  8.  EV  as  a  function  of  the  upper  bound  of  a  probability  distribution 
over  the  decision  rule. 
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PROBABILITY  THAT  STIMULUS  IS 
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EXPECTED  LOSS  (%) 
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